Observation points classifier ensemble for high‐dimensional imbalanced classification
نویسندگان
چکیده
In this paper, an Observation Points Classifier Ensemble (OPCE) algorithm is proposed to deal with High-Dimensional Imbalanced Classification (HDIC) problems based on data processed using the Multi-Dimensional Scaling (MDS) feature extraction technique. First, dimensionality of original imbalanced reduced MDS so that distances between any two different samples are preserved as well possible. Second, a novel OPCE applied classify by placing optimised observation points in low-dimensional space. Third, optimization point mappings carried out obtain reliable assessment unknown samples. Exhaustive experiments have been conducted evaluate feasibility, rationality, and effectiveness seven benchmark HDIC sets. Experimental results show (1) can be trained faster than high-dimensional data; (2) correctly identify number increased; (3) statistical analysis reveals yields better performances selected sets comparison eight other algorithms. This demonstrates viable problems.
منابع مشابه
Ensemble Classifier for Benign-Malignant Mass Classification
Mammography is currently the most effective imaging modality for early detection of breast cancer. In a CAD system for masses based on mammography, a mammogram is segmented to detect the masses. The segmentation gives rise to mass regions of interested (ROIs), which are either benign or malignant. There is a need to classify the extracted mass ROIs into benign and malignant masses; it is a hard...
متن کاملClassifier Ensemble for Uncertain Data Stream Classification
Currently available algorithms for data stream classification are all designed to handle precise data, while data with uncertainty or imperfection is quite natural and widely seen in real-life applications. Uncertainty can arise in attribute values as well as in class values. In this paper, we focus on the classification of streaming data that has different degrees of uncertainty within class v...
متن کاملEnsemble Approach for the Classification of Imbalanced Data
Ensembles are often capable of greater prediction accuracy than any of their individual members. As a consequence of the diversity between individual base-learners, an ensemble will not suffer from overfitting. On the other hand, in many cases we are dealing with imbalanced data and a classifier which was built using all data has tendency to ignore minority class. As a solution to the problem, ...
متن کاملEnsemble Learning for Imbalanced E-commerce Transaction Anomaly Classification
This paper presents the main results of our on-going work, one month before the deadline, on the 2009 UC San Diego data mining contest. The tasks of the contest are to rank the samples in two e-commerce transaction anomaly datasets according to the probability each sample has a positive label. The performance is evaluated by the lift at 20% on the probability of the two datasets. A main difficu...
متن کاملWeighted Neighborhood Classifier for the Classification of Imbalanced Tumor Dataset
Machine learning is widely applied to gene expression pro ̄les based molecular tumor classi ̄cation, but sample imbalance problem is often overlooked. This paper proposed a subclassweighted neighborhood classi ̄er to address the imbalanced sample set problem and a novel neighborhood rough set model to select informative genes for classi ̄cation performance improvement. Experiments on three publicly...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: CAAI Transactions on Intelligence Technology
سال: 2022
ISSN: ['2468-2322', '2468-6557']
DOI: https://doi.org/10.1049/cit2.12100